Transcribing audio communication sessions

ABSTRACT

A computer-implemented method to provide transcriptions of an audio communication session is disclosed. The method may include receiving audio data at a first device from a second device over a first wireless network connection of a first wireless network type and providing the audio data to a speaker of the first device for output of the audio data. The method may further include encoding the audio data at the first device based on a second wireless network type and transmitting the encoded audio data from the first device over a second wireless network of the second wireless network type.

CROSS-REFERENCE TO RELATED APPLICATION

This patent application is a continuation of U.S. patent applicationSer. No. 15/342,969, filed Nov. 3, 2016, which is a continuation of U.S.patent application Ser. No. 15/221,278, filed Jul. 27, 2016, now U.S.Pat. No. 9,497,315, both of which are incorporated herein by reference.

FIELD

The embodiments discussed herein are related to transcribing audiocommunication sessions.

BACKGROUND

Modern telecommunication services provide features to assist those whoare deaf or hearing-impaired. One such feature is a text captionedtelephone system for the hearing impaired that uses a relay system. Arelay system may be a telecommunication intermediary service that isintended to permit a hearing-impaired user to utilize a normal telephonenetwork.

The subject matter claimed herein is not limited to embodiments thatsolve any disadvantages or that operate only in environments such asthose described above. Rather, this background is only provided toillustrate one example technology area where some embodiments describedherein may be practiced.

SUMMARY

A computer-implemented method to provide transcriptions of an audiocommunication session is disclosed. The method may include receivingaudio data at a first device from a second device over a first wirelessnetwork connection of a first wireless network type and providing theaudio data to a speaker of the first device for output of the audiodata. The method may further include encoding the audio data at thefirst device based on a second wireless network type and transmittingthe encoded audio data from the first device over a second wirelessnetwork of the second wireless network type.

BRIEF DESCRIPTION OF THE DRAWINGS

Example embodiments will be described and explained with additionalspecificity and detail through the use of the accompanying drawings inwhich:

FIG. 1 illustrates an example system configured to providetranscriptions of an audio communication session;

FIG. 2 illustrates example operations in a system configured to providetranscriptions of an audio communication session;

FIG. 3 illustrates another example system configured to providetranscriptions of an audio communication session;

FIG. 4 illustrates an example audio output device;

FIG. 5 illustrates an example system that may be used in providingtranscriptions of an audio communication session;

FIG. 6 is a flowchart of an example computer-implemented method toprovide transcriptions of an audio communication session; and

FIG. 7 is a flowchart of another example computer-implemented method toprovide transcriptions of an audio communication session.

DESCRIPTION OF EMBODIMENTS

Some embodiments in this disclosure relate to a method and/or systemthat may transcribe audio communication sessions. In some embodiments, amobile device, for example a smart phone, may begin a communicationsession over a cellular network. A user of the mobile device may behearing-impaired. As a result, the user may not be able to fullyunderstand audio received during the communication session. The audiomay be voice data generated by another device participating in thecommunication session. In these and other embodiments, the audioreceived over the cellular network communication session may betransferred to an audio output device over a second wireless network.For example, the audio output device may be a Bluetooth headset and thesecond wireless network may be a Bluetooth network.

The audio output device may be configured to multiplex the receivedaudio. The multiplexed received audio may be sent to a speaker of theaudio output device to be broadcast to the user. The multiplexedreceived audio may also be encoded for transmission over a thirdwireless network to a transcription system. In these and otherembodiments, the transcription system may be configured to transcribethe audio and send a transcript of the audio to the mobile device forpresentation to the user. The transcript may assist the user inunderstanding the audio and better participate in the communicationsession.

In some embodiments, the systems and/or methods described in thisdisclosure may help to enable the transcription of audio received at amobile device or other devices during a communication session. In othersettings, audio received at devices during a communication session of acellular network may not be easily captured by an application running onthe device. For example, the audio from a phone call on a cellularnetwork using a smart phone or other cellular phone is typicallyisolated from third party applications that are executed by the smartphone. As a result, the audio from the phone call may not be easily, orin some circumstances may not be able to be, multiplexed and transferredto a transcription system to generate a transcript to assist the hearingimpaired during the phone call. The systems and/or methods described inthis disclosure may assist in enabling the capture and generation of atranscript of audio from a phone call on a cellular network using acellular enabled device. Thus, the systems and/or methods provide atleast a technical solution to a technical problem associated with thedesign of cellular enabled devices in the technology oftelecommunications.

Turning to the figures, FIG. 1 illustrates an example system 100configured to provide transcripts of an audio communication session. Thesystem 100 may be arranged in accordance with at least one embodimentdescribed in the present disclosure. The system 100 may include anetwork 102, a mobile device 110, an audio output device 120, an accesspoint 130, a transcription system 140, and a device 160.

The mobile device 110 may be any electronic or digital device. Forexample, the mobile device 110 may include a laptop computer, asmartphone, a mobile phone, a tablet computer, a phone console, or otherprocessing device. The mobile device 110 may be configured to transferdata over a first wireless network 150, a second wireless network 152,and a third wireless network 154. For example, the mobile device 110 maycommunicate with the access point 130 over the first wireless network150 and may communicate with the audio output device 120 over the secondwireless network 152 and the third wireless network 154. In someembodiments, the first wireless network 150, the second wireless network152, and the third wireless network 154 may be different types ofwireless networks.

In some embodiments, the first wireless network 150 may be a cellularwireless network. For example, the first wireless network 150 mayprovide voice services, data services, messaging services, and/or anysuitable combination thereof. The first wireless network 150 may includea Frequency Division Multiple Access (FDMA) network, an Orthogonal FDMA(OFDMA) network, a Code Division Multiple Access (CDMA) network, a TimeDivision Multiple Access (TDMA) network, and/or any other suitablewireless communication network. In some embodiments, the first wirelessnetwork 150 may be configured as a third generation (3G) wirelesscommunication network and/or a fourth generation (4G) wirelesscommunication network. In these or other embodiments, the first wirelessnetwork 150 may be configured as a long-term evolution (LTE) wirelesscommunication network.

The second wireless network 152 and the third wireless network 154 maybe configured as short-range wireless networks, such as a wireless localarea network (LAN), a personal area network (PAN), or a wireless meshnetwork (WMN). For example, the second wireless network 152 and thethird wireless network 154 may be networks that use Bluetooth Class 2and Class 3 communications with protocols that are managed by theBluetooth Special Interest Group (SIG). Other examples of wirelessnetworks may include the IEEE 802.11 networks (commonly referred to asWiFi), Zigbee networks, among other types of LANS, PANS, and WMNS.

In some embodiments, the mobile device 110 may be configured to receiveand transmit data over the first wireless network 150. For example, themobile device 110 may be part of a communication session with the device160 by way of the access point 130. In these and other embodiments, thecommunication session may be a phone call. As such, the communicationsession may be using protocols in the first wireless network 150 thatare used for phone calls and that are handled differently within themobile device 110 than other data transfers. For example, thecommunication session may be a cellular phone call that is not a Voiceover Internet Protocol (VoIP) phone call. Alternatively or additionally,the communication session may be established using built-in callingfunctionality in the mobile device 110 that is not provided by athird-party application.

In some embodiments, the mobile device 110 may be configured to receiveand to transmit data over the second wireless network 152 and the thirdwireless network 154 to communicate with the audio output device 120. Inthese and other embodiments, the mobile device 110 may be configured toreceive and transmit data over the first wireless network 150, thesecond wireless network 152, and the third wireless network 154 duringoverlapping time frames. In some embodiments, the mobile device 110 mayreceive and transmit data in real time over each of the first wirelessnetwork 150, the second wireless network 152, and the third wirelessnetwork 154. In these and other embodiments, the mobile device 110 mayreceive and transmit data in real time over each of the first wirelessnetwork 150, the second wireless network 152, and the third wirelessnetwork 154 in substantially overlapping time intervals.

In some embodiments, an application associated with the transcriptionsystem 140 may be executed by the mobile device 110. In these and otherembodiments, the application may be configured to communicate with thetranscription system 140 over the first wireless network 150 and thenetwork 102. In these and other embodiments, the application may beconfigured to provide audio data to the transcription system 140 and toreceive transcripts of the audio data from the transcription system 140over the first wireless network 150.

The audio output device 120 may be any electronic or digital device. Insome embodiments, the audio output device 120 may be configured toreceive and to transmit data over the second wireless network 152 andthe third wireless network 154 to communicate with the audio outputdevice 120. In some embodiments, the audio output device 120 may includea speaker and/or microphone. In these and other embodiments, the audiooutput device 120 may be configured to output data from the mobiledevice 110 by way of the speaker. In these and other embodiments, outputof the data may include providing the data to a speaker. The speaker mayact as a transducer to convert the data from analog electrical signalsto sound waves that are broadcast by the speaker. Alternatively oradditionally, the audio output device 120 may be configured to providedata to the mobile device 110 that is generated by the microphone basedon sounds waves received by the audio output device 120.

In some embodiments, the audio output device 120 may also be configuredto multiplex the data received from the mobile device 110 and/orgenerated by the audio output device 120. In these and otherembodiments, the audio output device 120 may receive data by way of thesecond wireless network 152, may multiplex the data to output the databy way of speaker, and may transmit the data back to the mobile device110 using the third wireless network 154. Alternatively or additionally,the audio output device 120 may generate data using the microphone. Theaudio output device 120 may multiplex the generated data and may providethe data to the mobile device 110 using the second wireless network 152and the third wireless network 154.

The access point 130 may be any suitable wireless network communicationpoint and may include a base station, an evolved node “B” (eNB) basestation, a remote radio head (RRH), or any other suitable communicationpoint. The access point 130 may be configured to establish the firstwireless network 150 and to communicate data between devices. Forexample, the access point 130 may include protocol to communicate withthe mobile device 110 and to maintain the first wireless network 150. Insome embodiments, the access point 130 may be configured to relay databetween the mobile device 110 and other devices in the system 100, suchas the device 160 and the transcription system 140.

In some embodiments, the network 102 may be any network or configurationof networks configured to send and receive communications betweendevices. In these and other embodiments, the network 102 may beconfigured to communicatively couple the device 160 to the access point130 and the access point 130 to the transcription system 140.

In some embodiments, the network 102 may include a conventional typenetwork, a wired or wireless network, and may have numerous differentconfigurations. Furthermore, the network 102 may include a local areanetwork (LAN), a wide area network (WAN) (e.g., the Internet), or otherinterconnected data paths across which multiple devices and/or entitiesmay communicate. In some embodiments, the network 102 may include apeer-to-peer network. The network 102 may also be coupled to or mayinclude portions of a telecommunications network for sending data in avariety of different communication protocols. In some embodiments, thenetwork 102 may include cellular communication networks for sending andreceiving communications and/or data including via hypertext transferprotocol (HTTP), direct data connection, wireless application protocol(WAP), e-mail, etc. The network 102 may also include a mobile datanetwork that may include third-generation (3G), fourth-generation (4G),long-term evolution (LTE), long-term evolution advanced (LTE-A),Voice-over-LTE (“VoLTE”) or any other mobile data network or combinationof mobile data networks. Further, the network 102 may include one ormore IEEE 802.11 wireless networks. In some embodiments, the network 102may be the same or different than the first wireless network 150, thesecond wireless network 152, and/or the third wireless network 154. Inshort, the network 102 may be any type or combination of networks thatare configured to provide communications between devices.

In some embodiments, the device 160 may be any electronic, analog, ordigital device. For example, the device 160 may include a laptopcomputer, a smartphone, a mobile phone, a tablet computer, a phoneconsole, a public switched telephone network (PSTN) phone, or any othercommunication device. The device 160 may be configured to participate ina communication session with the mobile device 110. In the communicationsession, data, such as audio data may be exchanged between the device160 and the mobile device 110 by way of the network 102 and the firstwireless network 150. In these and other embodiments, the network 102that communicatively couples the device 160 and the access point 130 maybe the same as or different than the first wireless network 150.

In some embodiments, the transcription system 140 may include anyconfiguration of hardware, such as processors, servers, and databasesthat are networked together and configured to generate transcripts bytranscribing audio data received over the network 102. For example, thetranscription system 140 may include multiple computing systems, such asmultiple servers that each include memory and at least one processor,which are networked together and configured to perform operations asdescribed in this disclosure, among other operations. In someembodiments, the transcription system 140 may includecomputer-readable-instructions that are configured to be executed by thetranscription system 140 to perform operations described in thisdisclosure.

In some embodiments, the transcription system 140 may be configured totranscribe audio data received from the mobile device 110 to generatetranscripts of the audio data. In some embodiments, the audio data maybe data received by the mobile device 110 over the first wirelessnetwork 150, such as from the device 160. Alternatively or additionally,the audio data may be generated by the audio output device 120 and maybe received by the mobile device 110 from the audio output device 120over the second wireless network 152 or the third wireless network 154.

In some embodiments, the transcription system 140 may be configured toprovide the generated transcripts to the mobile device 110 over thenetwork 102 and the first wireless network 150. In these and otherembodiments, the mobile device 110 may be configured to display thegenerated transcripts on a display of the mobile device 110.

An example of the operation of the system 100 follows. A communicationsession may be established between the mobile device 110 and the device160 by way of the access point 130 using the first wireless network 150and the network 102. For example, the mobile device 110 may call thedevice 160 to establish the communication session. Alternatively oradditionally, the device 160 may call the mobile device 110 to establishthe communication session.

The mobile device 110 may be used by a user that is hearing-impaired. Asused in the present disclosure, a “hearing-impaired user” may refer to aperson with diminished hearing capabilities. Hearing-impaired usersoften have some level of hearing ability that has usually diminishedover a period of time such that the hearing-impaired user cancommunicate by speaking, but that the hearing-impaired user oftenstruggles in hearing and/or understanding others.

During the communication session, the mobile device 110 may receiveaudio data over the first wireless network 150 from the access point130. The audio data may originate from the device 160. For example, theaudio data may be based on a voice signal from a user of the device 160.For example, the voice signal may be words spoken by the user of thedevice 160 during a conversation with the user of the mobile device 110during the communication session.

The mobile device 110 may transmit the audio data over the secondwireless network 152 to the audio output device 120. The audio outputdevice 120 may multiplex the audio data to output the audio data by wayof the speaker included in the audio output device 120 and to transmitthe audio data over the third wireless network 154 to the mobile device110.

The mobile device 110 may receive the audio data from the audio outputdevice 120 by way of the third wireless network 154. The mobile device110, by way of an application executed by the mobile device 110 andassociated with the transcription system 140, may send the audio datareceived from the audio output device 120 over the first wirelessnetwork 150 to the transcription system 140. The transcription system140 may transcribe the audio data to generate a transcript of the audiodata. In some embodiments, to transcribe the audio data, a callassistant may listen to the audio data and “revoice” the words of theaudio data to a speech recognition computer program tuned to the voiceof the call assistant. In some embodiments, the transcript may begenerated by the speech recognition computer.

The transcript of the audio data may be provided over the first wirelessnetwork 150 to the mobile device 110. In some embodiments, the mobiledevice 110, by way of the application associated with the transcriptionsystem 140 and an electronic display, may display the transcript whilethe hearing-impaired user carries on a normal conversation with a userof the device 160 during the communication session. The transcript mayallow the hearing-impaired user to supplement the audio data receivedfrom the device 160 and confirm his or her understanding of the wordsspoken by the user of the device 160.

The system 100 may be configured to provide the transcript of the audiodata in substantially real-time or real-time. For example, thetranscript of the audio data may be provided to the mobile device 110 inless than 2, 3, 5, or 10 seconds after the audio data is presented tothe user of the mobile device 110 by the audio output device 120.

As described, the system 100 may be configured to provide transcripts ofaudio data received by the mobile device 110. As discussed previously,in some circumstances, the mobile device 110 may be configured such thatan application associated with the transcription system 140 executed bythe mobile device 110 may be unable to obtain the audio data. As aresult, the mobile device 110 may be unable to provide the audio data tothe transcription system 140 for a transcript of the audio data. Thesystem 100 may provide a manner for a transcript to be provided to themobile device 110.

In some embodiments, the device 160 may be associated with a hearingimpaired user and may be in communication with the transcription system140. In these and other embodiments, the audio output device 120 maycapture words spoken by the user of the mobile device 110 as audio data.The audio output device 120 may send the audio data to the mobile device110 over the second wireless network 152 and/or the third wirelessnetwork 154. The audio data transmitted over the second wireless network152 may be sent to the device 160 as audio data for presentation to theuser of the device 160 as normal audio data. The mobile device 110, byway of an application associated with the transcription system 140, maytransmit the audio data received over the second wireless network 152and/or third wireless network 154 to the transcription system 140 withan indication that the transcript be provided to the device 160. Thetranscription system 140 may transcribe the audio data and provide thetranscript to the device 160. The device 160 may present the transcripton a display of the device 160 for the hearing-impaired user of thedevice 160. Alternatively or additionally, the device 160 may beassociated with an audio output device 120 and may receive transcriptsof audio data originating from the mobile device 110 in a manneranalogous to how the mobile device 110 receives transcripts.

Modifications, additions, or omissions may be made to the system 100without departing from the scope of the present disclosure. For example,in some embodiments, the system 100 may include additional devicessimilar to the mobile device 110. Alternatively or additionally, thedevice 160 may be paired with another device that is similar to theaudio output device 120. Alternatively or additionally, one or more ofthe communication couplings between the mobile device 110 and the audiooutput device 120 may be a wired connection. In these and otherembodiments, the second wireless network 152 may be a wired network.Alternatively or additionally, the third wireless network 154 may be awired network.

In some embodiments, the mobile device 110 and the audio output device120 may be communicatively coupled by one wireless or wired network. Inthese and other embodiments, audio data may be received by the audiooutput device 120, multiplexed, and sent back to the mobile device 110using the same wireless or wired network.

FIG. 2 illustrates example operations 200 in a system configured toprovide transcriptions of an audio communication session. The operations200 may be arranged in accordance with at least one embodiment describedin the present disclosure. The operations 200 illustrated are notexhaustive but are merely representative of operations 200 that mayoccur. Furthermore, one operation as illustrated may represent one ormore communications, operations, and/or data exchanges. Alternatively oradditionally, the ordering of the operations 200 may differ than thatillustrated in FIG. 2.

The operations 200 may occur between a mobile device 210, an audiooutput device 220, and a transcription system 240. In some embodiments,the mobile device 210, the audio output device 220, and thetranscription system 240 may be analogous to the mobile device 110, theaudio output device 120, and the transcription system 140, respectively,of FIG. 1. Accordingly, no further explanation is provided with respectthereto.

In some embodiments, the operations 200 may be an example ofcommunications and interactions between the mobile device 210, the audiooutput device 220, and the transcription system 240 that may occur overone or more networks. For example, the mobile device 210 may beconfigured to exchange data over three different wireless networks, acellular network, a Bluetooth network, and a WIFI network. In these andother embodiments, the mobile device 210 and the audio output device 220may be configured to exchange data over two different wireless networks,a Bluetooth network and a WIFI network. Lines representing operations200 that may occur or partially occur over a wireless network arerepresented differently depending on the wireless network used for thoseoperations 200. For example, in FIG. 2, a solid line may represent anoperation, such as a communication, over a Bluetooth network. A dashedline may represent an operation, such as a communication, over a WIFInetwork. A dash-dot line may represent an operation, such as acommunication, over a cellular network. Each of these networks may havedifferent protocols, operating frequencies, and/or data packetconfigurations.

The operations 200 may include obtaining 250 first audio data at themobile device 210 over a cellular network. The first audio data may beobtained from another device. In some embodiments, the first audio datamay be based on voice signals received at the other device during acommunication session between the mobile device 210 and the otherdevice.

A Bluetooth wireless connection may be established 252 between themobile device 210 and the audio output device 220. In these and otherembodiments, the mobile device 210 and the audio output device 220 mayexchange one or more packets of data following Bluetooth protocols toestablish the Bluetooth wireless connection. In these and otherembodiments, the Bluetooth audio connection may be established to streamaudio data from the mobile device 210 to the audio output device 220.

In some embodiments, the audio data streamed between the mobile device210 and the audio output device 220 may be streamed using an advancedaudio distribution profile (A2DP). In these and other embodiments, themobile device 210 may act as the source device and the audio outputdevice 220 may act as the sink device.

The mobile device 210 may transmit and the audio output device 220 mayreceive 254, the first audio data from the cellular network. In theseand other embodiments, the mobile device 210 may decode the first audiodata from the cellular network, obtain the data packets, and repackagethe data packets based on the Bluetooth protocol to transmit the audiodata to the audio output device 220 over the Bluetooth network.

After receiving the first audio data, the audio output device 220 maydecode the first audio data and multiplex the first audio data intofirst and second audio data streams. The first audio data stream may beoutput 255 at a speaker. In these and other embodiments, to output thefirst audio data stream, the first audio data stream may be sent to aspeaker that may act as a transducer to convert the analog or digitalfirst audio data stream to sound waves that may be heard by the user ofthe mobile device 210.

A WiFi wireless connection may be established 256 between the mobiledevice 210 and the audio output device 220. In these and otherembodiments, the mobile device 210 and the audio output device 220 mayexchange one or more packets of data following WiFi protocols toestablish the WiFi wireless connection. In these and other embodiments,the WiFi audio connection may be established to stream audio data fromthe mobile device 210 to the audio output device 220. In someembodiments, the WiFi wireless connection may be established using anaccess point of a WiFi wireless network. In these and other embodiments,the access point may act as a router to route data communicationsbetween the mobile device 210 and the audio output device 220.

The audio output device 220 may transmit and the mobile device 210 mayreceive 258, the second audio data stream over the WiFi network. Inthese and other embodiments, the audio output device 220 may encode thesecond audio data stream based on the WiFi protocol to transmit theaudio data to the mobile device 210 over the WiFi network. Afterreceiving the audio data, the mobile device 210 may decode the secondaudio data stream and prepare the second audio data stream fortransmission to the transcription system 240.

A network connection may be established 260 between the mobile device210 and the transcription system 240. In these and other embodiments,the mobile device 210 and the transcription system 240 may exchange oneor more packets of data to establish a connection between the mobiledevice 210 and the transcription system 240. For example, an applicationassociated with the transcription system 240 may be executed by themobile device 210. The application may include authentication protocolsto establish a network connection with a server of the transcriptionsystem 240 to allow for the exchange of information between the mobiledevice 210 and the transcription system 240.

The mobile device 210 may transmit and the transcription system 240 mayreceive 262 the second audio data stream over the cellular network. Inthese and other embodiments, the transcription system 240 may decode thesecond audio data stream from the cellular network, may obtain the datapackets, and may create a transcript 267 using the data packets. Thetranscription system 240 may transmit and the mobile device 210 mayreceive 268 the transcript over the cellular network.

The audio output device 220 may further generate second audio data. Forexample, the second audio data may be generated based on voice signalscaptured by the audio output device 220 from a user of the mobile device210 and the audio output device 220. The audio output device 220 mayencode the second audio data based on the Bluetooth connection. Theaudio output device 220 may transmit and the mobile device 210 mayreceive 264 the second audio data over the Bluetooth network. The mobiledevice 210 may decode the second audio data, may encode the second audiodata based on the cellular network protocols, and may transmit thesecond audio data 266 over the cellular network. The second audio datamay be part of a communication session between the mobile device 210 andthe other device.

The mobile device 210 may present the transcript 270 to the user of themobile device 210. For example, the mobile device 210 may display thetranscript on an electronic display of the mobile device 210. Bydisplaying the transcript, a user of the mobile device 210 may read andhear the first audio data.

The operations 200 may continue. For example, the operations 200 mayinclude obtaining 272 third audio data at the mobile device 210 over thecellular network. The third audio data may be obtained from the otherdevice. The mobile device 210 may transmit and the audio output device220 may receive 274, the third audio data from the cellular network.

After receiving the third audio data, the audio output device 220 maydecode the third audio data and may multiplex the third audio data intothird and fourth audio data streams. The third audio data stream may beoutput 276 at a speaker. The fourth audio data stream may be transmittedback to the mobile device 210 by the audio output device 220 over theWiFi network.

Modifications, additions, or omissions may be made to the operations 200without departing from the scope of the present disclosure. For example,in some embodiments, the order of the operations 200 may change. Forexample, the establishment of the Bluetooth network connection 252, theWiFi network connection 256, and the network connection 260 may occurbefore the first audio data is obtained 250. Alternatively oradditionally, the establishment of network connections may occur duringother times. Furthermore, the operations 200 may not occur in a linearfashion as illustrated, but may overlap in time such that they occur atsubstantially the same time. For example, some of the operations, suchas the transmitting and receiving of the second audio data between theaudio output device 220 and the mobile device 210 may occur while thefirst audio data is transmitted to the transcription system 240 from themobile device 210.

FIG. 3 illustrates another example system 300 configured to providetranscriptions of an audio communication session. The system 300 may bearranged in accordance with at least one embodiment described in thepresent disclosure. The system 300 may include a network 302, a mobiledevice 310, an audio output device 320, a first access point 330, asecond access point 332, and a transcription system 340. In someembodiments, the network 302, the mobile device 310, the audio outputdevice 320, the first access point 330, and the transcription system 340may be analogous to the network 102, the mobile device 110, the audiooutput device 120, the access point 130, and the transcription system140 respectively, of FIG. 1. Accordingly, no further explanation isprovided with respect thereto.

The mobile device 310 may be configured to transfer data over a firstwireless network 350, a second wireless network 352, and a thirdwireless network 354. The first wireless network 350 and the secondwireless network 352 may be analogous to the first wireless network 150and the second wireless network 152 of FIG. 1. Accordingly, no furtherexplanation is provided with respect thereto. As illustrated in FIG. 3,the mobile device 310 may communicate with the first access point 330over the first wireless network 350, may communicate with the audiooutput device 320 over the second wireless network 352, and maycommunicate with the second access point 332 over the third wirelessnetwork 354. In some embodiments, the first wireless network 350, thesecond wireless network 352, and the third wireless network 354 may bedifferent types of wireless networks.

The system 300 may operate in some respects in a manner analogous to thesystem 100 of FIG. 1. For example, the mobile device 310 may receiveaudio data over the first wireless network 350. The mobile device 310may send the audio data to the audio output device 320 over the secondwireless network 352. The audio output device 320 may multiplex theaudio data to send a first stream of the audio data to be output by theaudio output device 320 and to send a second stream of the audio data tothe transcription system 340.

In contrast to system 100, the system 300 may include the second accesspoint 332. The second access point 332 may be any suitable wirelessnetwork communication point for the third wireless network 354. Forexample, the third wireless network 354 may be a WiFi network and secondaccess point 332 may be a WiFi wireless router configured to establishthe third wireless network 354.

In some embodiments, the audio output device 320 may establish a networkconnection with the transcription system 340 through the third wirelessnetwork 354, the second access point 332, and the network 302. In theseand other embodiments, the audio output device 320 may send the secondstream of the audio data to the transcription system 340 by way of thesecond access point 332. In these and other embodiments, the mobiledevice 310 may not receive the second stream of the audio data.

In some embodiments, the transcript generated by the transcriptionsystem 340 may be provided to the mobile device 310 by way of the thirdwireless network 354. In these and other embodiments, the mobile device310 may establish a network connection with the transcription system 340through the third wireless network 354, the second access point 332, andthe network 302. After the transcription system 340 generates thetranscript of the audio data, the transcription system 340 may providethe audio data to the mobile device 310 for presentation.

In these and other embodiments, the audio output device 320 may beconfigured to establish the network connection with the transcriptionsystem 340 using the mobile device 310. For example, the mobile device310 may provide an IP address and authentication information to theaudio output device 320. The audio output device 320 may use the IPaddress and authentication information to establish the networkconnection with the transcription system 340. Alternatively oradditionally, the audio output device 320 may include a user interfaceor other connection with another device to obtain the information forthe network connection with the transcription system 340.

In some embodiments, the mobile device 310 may not have a networkconnection with the transcription system 340 by way of the thirdwireless network 354. Rather, a network connection with thetranscription system 340 may be established using the first wirelessnetwork 350. In these and other embodiments, the transcription system340 may provide the transcripts of audio data to the mobile device 310by way of the first access point 330 and the first wireless network 350.

Modifications, additions, or omissions may be made to the system 300without departing from the scope of the present disclosure. For example,in some embodiments, the transcription system 340 may provide thetranscript to the audio output device 320 through the third wirelessnetwork 354. In some embodiments, the audio output device 320 maypresent the transcript and/or the audio output device 320 may relay thetranscript to the mobile device 310.

FIG. 4 illustrates an example audio output device 400. The audio outputdevice 400 may be an example of the audio output device 120 of FIG. 1,the audio output device 220 of FIG. 2, and/or the audio output device320 of FIG. 3. The audio output device 400 may include a first antenna402, a second antenna 404, a microphone 406, a speaker 408, a firsttransceiver 410, a second transceiver 412, an operating system 420, aBluetooth protocol handler 430, a WiFi protocol handler 440, and anaudio codec 450.

The first antenna 402 may be configured to transmit and receive dataover a first wireless network. To transmit data, the first antenna 402may receive the data from the first transceiver 410. When receivingdata, the first antenna 402 may provide the received data to the firsttransceiver 410. The first transceiver 410 may be configured to handlethe transmission and receiving of data over the first antenna 402. Inthese and other embodiments, the first transceiver 410 may be configuredto encode and decode signals at the operating frequency of the firstwireless network.

The second antenna 404 may be configured to transmit and receive dataover a second wireless network. To transmit data, the second antenna 404may receive the data from the second transceiver 412. When receivingdata, the second antenna 404 may provide the received data to the secondtransceiver 412. The second transceiver 412 may be configured to handlethe transmission and receiving of data over the second antenna 404. Inthese and other embodiments, the second transceiver 412 may beconfigured to encode and decode signals at the operating frequency ofthe second wireless network.

The operating system 420 may include the Bluetooth protocol handler 430,the WiFi protocol handler 440, and the audio codec 450. The Bluetoothprotocol handler 430 may be configured to handle the wireless protocolsfor the first wireless network. For example, the Bluetooth protocolhandler 430 may handle the signaling protocols to establish and maintainthe first wireless network. Alternatively or additionally, the Bluetoothprotocol handler 430 may be configured to encode data into data packetsfor transmission over the first wireless network as well as to extractdata from data packets received over the first wireless network.

The WiFi protocol handler 440 may be configured to handle the wirelessprotocols for the second wireless network. For example, the WiFiprotocol handler 440 may handle the signaling protocols and the encodingand the decoding of data packets for the second wireless network as wellas to extract data from data packets received over the second wirelessnetwork.

The microphone 406 may be configured to receive sound waves and toconvert the sound waves to analog audio data. The analog audio data maybe provided to the audio codec 450. The speaker 408 may be configured toreceive analog audio data from the audio codec 450 and to convert theanalog audio data to sound waves that are broadcast from the audiooutput device 400.

The audio codec 450 may be configured to receive analog audio data fromthe microphone 406 and to convert the analog audio data to digital audiodata. The audio codec 450 may be further configured to process thedigital and/or analog audio data from the microphone 406. In someembodiments, the audio codec 450 may multiplex the audio data after itis received from the microphone 406 into a first stream and a secondstream. In these and other embodiments, the audio codec 450 may providethe first stream to the Bluetooth protocol handler 430 and may providethe second stream to the WiFi protocol handler 440 for transmission bythe first antenna 402 and the second antenna 404, respectively.Alternatively or additionally, the audio codec 450 may be configured topass the audio data without multiplexing the audio data from themicrophone 406 to the Bluetooth protocol handler 430 for transmission bythe first antenna 402.

The audio codec 450 may be further configured to receive digital audiodata from the Bluetooth protocol handler 430. The digital audio data maybe received over the first network by the first antenna 402. The audiocodec 450 may multiplex the digital audio data into a first stream and asecond stream. The first stream may be provided to the WiFi protocolhandler 440 for encoding and transmission over the second network by wayof the second antenna 404. The second stream may be decompressed basedon a compression scheme applied to the digital audio data. Afterdecompression, the digital audio data may be converted to analog audiodata and provided to the speaker 408.

In some embodiments, the audio codec 450 may multiplex digital audiodata by making a copy of the digital audio data. Alternatively oradditionally, the audio codec 450 may multiplex the digital audio databy driving the digital audio data received by the Bluetooth protocolhandler 430 and stored in the audio codec 450 to two differentlocations, namely, the speaker 408 and the WiFi protocol handler 440.For example, the digital audio data may be received by the audio codec450 from the Bluetooth protocol handler 430. The audio codec 450 maydirect the digital audio data to the WiFi protocol handler 440.Additionally, the audio codec 450 may direct the digital audio data tobe decoded based on the compression scheme of the digital audio data, tobe converted to analog audio data, and to be sent to the speaker 408.

Modifications, additions, or omissions may be made to the audio outputdevice 400 without departing from the scope of the present disclosure.For example, in some embodiments, the audio output device 400 mayinclude additional components, such as a display. Alternatively oradditionally, the audio output device 400 may include fewer components.For example, in some embodiments, the audio output device 400 mayinclude a single antenna for both the first and second wirelessnetworks. As another example, the audio output device 400 may includealternative components. For example, the Bluetooth protocol handler 430and the WiFi protocol handler 440 may be performed by a processor, suchas a processor discussed with respect to FIG. 5.

FIG. 5 illustrates an example computing system 500 that may be used inobtaining VoIP credentials for a device. The system 500 may be arrangedin accordance with at least one embodiment described in the presentdisclosure. The system 500 may include a processor 510, a memory 512, adata storage 514, a communication unit 516, an electronic display 518, auser interface unit 520, and a peripheral device 522. In someembodiments, the system 500 may be part of any of the systems or devicesdescribed in this disclosure. For example, the system 500 may be part ofthe mobile device 110, the audio output device 120, and/or thetranscription system 140 of FIG. 1.

Generally, the processor 510 may include any suitable special-purpose orgeneral-purpose computer, computing entity, or processing deviceincluding various computer hardware or software modules and may beconfigured to execute instructions stored on any applicablecomputer-readable storage media. For example, the processor 510 mayinclude a microprocessor, a microcontroller, a digital signal processor(DSP), an application-specific integrated circuit (ASIC), aField-Programmable Gate Array (FPGA), or any other digital or analogcircuitry configured to interpret and/or to execute program instructionsand/or to process data.

Although illustrated as a single processor in FIG. 5, it is understoodthat the processor 510 may include any number of processors distributedacross any number of networks or physical locations that are configuredto perform individually or collectively any number of operationsdescribed herein. In some embodiments, the processor 510 may interpretand/or execute program instructions and/or process data stored in thememory 512, the data storage 514, or the memory 512 and the data storage514. In some embodiments, the processor 510 may fetch programinstructions from the data storage 514 and load the program instructionsinto the memory 512.

After the program instructions are loaded into the memory 512, theprocessor 510 may execute the program instructions. For example, thesystem 500 may be part of the mobile device 110 of FIG. 1. In these andother embodiments, the instructions may include the processor 510instructing the communication unit 516 to establish a network connectionwith the audio output device 120 over the second and the third wirelessnetworks 152 and 154. Alternatively or additionally, the instructionsmay include the processor 510 instructing the communication unit 516 totransmit audio data to the transcription system 140. Alternatively oradditionally, the instructions may include the processor 510 instructingthe electronic display 518 to display a transcript received from thetranscription system 140. Alternatively or additionally, theinstructions may include the processor 510 executing instructions for anapplication that communicates with the transcription system 140 asdescribed.

As another example, the system 500 may be part of the audio outputdevice 120. In these and other embodiments, the instructions may includethe processor 510 instructing the encoding and decoding of audio datafor both of the second and the third wireless networks 152 and 154.

The memory 512 and the data storage 514 may include computer-readablestorage media or one or more computer-readable storage mediums forcarrying or having computer-executable instructions or data structuresstored thereon. Such computer-readable storage media may be anyavailable media that may be accessed by a general-purpose orspecial-purpose computer, such as the processor 510. By way of example,and not limitation, such computer-readable storage media may includenon-transitory computer-readable storage media including Random AccessMemory (RAM), Read-Only Memory (ROM), Electrically Erasable ProgrammableRead-Only Memory (EEPROM), Compact Disc Read-Only Memory (CD-ROM) orother optical disk storage, magnetic disk storage or other magneticstorage devices, flash memory devices (e.g., solid state memorydevices), or any other storage medium which may be used to carry orstore desired program code in the form of computer-executableinstructions or data structures and which may be accessed by ageneral-purpose or special-purpose computer. Combinations of the abovemay also be included within the scope of computer-readable storagemedia. In these and other embodiments, the term “non-transitory” as usedherein should be construed to exclude only those types of transitorymedia that were found to fall outside the scope of patentable subjectmatter in the Federal Circuit decision of In re Nuijten, 500 F.3d 1346(Fed. Cir. 2007). In some embodiments, computer-executable instructionsmay include, for example, instructions and data configured to cause theprocessor 510 to perform a certain operation or group of operations,such as one or more of the operations 200, one or more blocks of themethod 600, and/or one or more blocks of the method 700.

The communication unit 516 may include any component, device, system, orcombination thereof that is configured to transmit or receiveinformation over a network. In some embodiments, the communication unit516 may communicate with other devices at other locations, the samelocation, or even other components within the same system. For example,the communication unit 516 may include a modem, a network card (wirelessor wired), an infrared communication device, a wireless communicationdevice (such as an antenna), and/or chipset (such as a Bluetooth device,an 802.6 device (e.g., Metropolitan Area Network (MAN)), a WiFi device,a WiMax device, cellular communication facilities, etc.), and/or thelike. The communication unit 516 may permit data to be exchanged with anetwork and/or any other devices or systems described in the presentdisclosure.

The electronic display 518 may be configured as one or more electronicdisplays, like an LCD, LED, or other type display. The electronicdisplay 518 may be configured to present video, text captions, userinterfaces, and other data as directed by the processor 510.

The user interface unit 520 may include any device to allow a user tointerface with the system 500. For example, the user interface unit 520may include a mouse, a track pad, a keyboard, and/or a touchscreen,among other devices. The user interface unit 520 may receive input froma user and provide the input to the processor 510.

The peripheral devices 522 may include one or more devices. For example,the peripheral devices may include a microphone, an imager, and/or aspeaker, among other peripheral devices. In these and other embodiments,the microphone may be configured to capture audio. The imager may beconfigured to capture digital images. The digital images may be capturedin a manner to produce video or image data. In some embodiments, thespeaker may broadcast audio received by the system 500 or otherwisegenerated by the system 500.

Modifications, additions, or omissions may be made to the system 500without departing from the scope of the present disclosure. For example,when the system 500 is part of the audio output device 120 of FIG. 1,the system 500 may not include one or more of the electronic display518, the peripheral devices 522, or other aspects of the system 500.

FIG. 6 is a flowchart of an example computer-implemented method toprovide transcriptions of an audio communication session. The method 600may be arranged in accordance with at least one embodiment described inthe present disclosure. The method 600 may be performed, in whole or inpart, in some embodiments by a system, such as the system 100, thesystem 300, the audio output device 400, and/or the system 500 of FIGS.1, 3, 4, and 5, respectively. In these and other embodiments, the method600 may be performed based on the execution of instructions stored onone or more non-transitory computer-readable media. Although illustratedas discrete blocks, various blocks may be divided into additionalblocks, combined into fewer blocks, or eliminated, depending on thedesired implementation.

The method 600 may begin at block 602, where audio data may be receivedat an audio output device from a mobile device over a Bluetooth wirelessnetwork connection. In some embodiments, the audio data may be receivedby the mobile device over a cellular wireless network. In someembodiments, the mobile device may be a smartphone, laptop, or tablet.In some embodiments, the audio output device may be a Bluetooth wirelessheadset that is paired with the mobile device. In some embodiments, theaudio data may be provided from the mobile device to the transcriptionsystem at least partially over the cellular wireless network.

In block 604, the audio data may be provided to a speaker of the audiooutput device for output of the audio data. In block 606, the audio datamay be encoded at the audio output device based on an IEEE 802.11protocol.

In block 608, the encoded audio data may be transmitted from the audiooutput device to the mobile device over an IEEE 802.11 wireless network.In block 610, the audio data transmitted over the IEEE 802.11 wirelessnetwork may be obtained from the mobile device at a transcriptionsystem.

In block 612, a transcript of the audio data may be generated at thetranscription system. In block 614, the transcript may be provided tothe mobile device for output of the transcript on the mobile device. Insome embodiments, the transcript of the audio data may be provided inreal time.

One skilled in the art will appreciate that, for this and otherprocesses, operations, and methods disclosed herein, the functionsand/or operations performed may be implemented in differing order.Furthermore, the outlined functions and operations are only provided asexamples, and some of the functions and operations may be optional,combined into fewer functions and operations, or expanded intoadditional functions and operations without detracting from the essenceof the disclosed embodiments.

FIG. 7 is a flowchart of another example computer-implemented method toprovide transcriptions of an audio communication session. The method 700may be arranged in accordance with at least one embodiment described inthe present disclosure. The method 700 may be performed, in whole or inpart, in some embodiments by a system, such as the system 100, thesystem 300, the audio output device 400, and/or the system 500 of FIGS.1, 3, 4, and 5, respectively. In these and other embodiments, the method700 may be performed based on the execution of instructions stored onone or more non-transitory computer-readable media. Although illustratedas discrete blocks, various blocks may be divided into additionalblocks, combined into fewer blocks, or eliminated, depending on thedesired implementation.

The method 700 may begin at block 702, where audio data may be receivedat a first device from a second device over a first wireless networkconnection of a first wireless network type. In some embodiments, theaudio data may be received by the second device over a cellular wirelessnetwork.

In block 704, the audio data may be provided to a speaker of the firstdevice for output of the audio data. In block 706, the audio data may beencoded at the first device based on a second wireless network type. Insome embodiments, the first wireless network type may be a Bluetoothwireless network and the second wireless network type may be an IEEE802.11 wireless network.

In block 708, the encoded audio data may be transmitted from the firstdevice over a second wireless network of the second wireless networktype. In some embodiments, the audio data may be transmitted from thefirst device over the second wireless network to the second device.Alternatively or additionally, the audio data may be transmitted fromthe first device over the second wireless network to a network accesspoint of the second wireless network.

One skilled in the art will appreciate that, for this and otherprocesses, operations, and methods disclosed herein, the functionsand/or operations performed may be implemented in differing order.Furthermore, the outlined functions and operations are only provided asexamples, and some of the functions and operations may be optional,combined into fewer functions and operations, or expanded intoadditional functions and operations without detracting from the essenceof the disclosed embodiments.

For example, in some embodiments, the method 700 may further includeobtaining, at a transcription system, the audio data transmitted overthe second wireless network and generating, at the transcription system,a transcript of the audio data.

Alternatively or additionally, the method 700 may further includeproviding the transcript to the second device for output of thetranscript on the second device. In these and other embodiments, thetranscript of the audio data may be provided in real time. Alternativelyor additionally, the audio data may be obtained at the transcriptionsystem by way of the second device that receives the encoded audio databy way of the second wireless network. In these and other embodiments,the audio data may be obtained at the transcription system by way of anetwork access point of the second wireless network that receives theencoded audio data by way of the second wireless network.

As indicated above, the embodiments described herein may include the useof a special purpose or general purpose computer (e.g., the processor510 of FIG. 5) including various computer hardware or software modules,as discussed in greater detail below. Further, as indicated above,embodiments described herein may be implemented using computer-readablemedia (e.g., the memory 512 or data storage 514 of FIG. 5) for carryingor having computer-executable instructions or data structures storedthereon.

In some embodiments, the different components, modules, engines, andservices described herein may be implemented as objects or processesthat execute on a computing system (e.g., as separate threads). Whilesome of the systems and methods described herein are generally describedas being implemented in software (stored on and/or executed by generalpurpose hardware), specific hardware implementations or a combination ofsoftware and specific hardware implementations are also possible andcontemplated.

In accordance with common practice, the various features illustrated inthe drawings may not be drawn to scale. The illustrations presented inthe present disclosure are not meant to be actual views of anyparticular apparatus (e.g., device, system, etc.) or method, but aremerely idealized representations that are employed to describe variousembodiments of the disclosure. Accordingly, the dimensions of thevarious features may be arbitrarily expanded or reduced for clarity. Inaddition, some of the drawings may be simplified for clarity. Thus, thedrawings may not depict all of the components of a given apparatus(e.g., device) or all operations of a particular method.

Terms used herein and especially in the appended claims (e.g., bodies ofthe appended claims) are generally intended as “open” terms (e.g., theterm “including” should be interpreted as “including, but not limitedto,” the term “having” should be interpreted as “having at least,” theterm “includes” should be interpreted as “includes, but is not limitedto,” etc.).

Additionally, if a specific number of an introduced claim recitation isintended, such an intent will be explicitly recited in the claim, and inthe absence of such recitation no such intent is present. For example,as an aid to understanding, the following appended claims may containusage of the introductory phrases “at least one” and “one or more” tointroduce claim recitations. However, the use of such phrases should notbe construed to imply that the introduction of a claim recitation by theindefinite articles “a” or “an” limits any particular claim containingsuch introduced claim recitation to embodiments containing only one suchrecitation, even when the same claim includes the introductory phrases“one or more” or “at least one” and indefinite articles such as “a” or“an” (e.g., “a” and/or “an” should be interpreted to mean “at least one”or “one or more”); the same holds true for the use of definite articlesused to introduce claim recitations.

In addition, even if a specific number of an introduced claim recitationis explicitly recited, those skilled in the art will recognize that suchrecitation should be interpreted to mean at least the recited number(e.g., the bare recitation of “two recitations,” without othermodifiers, means at least two recitations, or two or more recitations).Furthermore, in those instances where a convention analogous to “atleast one of A, B, and C, etc.” or “one or more of A, B, and C, etc.” isused, in general such a construction is intended to include A alone, Balone, C alone, A and B together, A and C together, B and C together, orA, B, and C together, etc. For example, the use of the term “and/or” isintended to be construed in this manner.

Further, any disjunctive word or phrase presenting two or morealternative terms, whether in the description, claims, or drawings,should be understood to contemplate the possibilities of including oneof the terms, either of the terms, or both terms. For example, thephrase “A or B” should be understood to include the possibilities of “A”or “B” or “A and B.”

However, the use of such phrases should not be construed to imply thatthe introduction of a claim recitation by the indefinite articles “a” or“an” limits any particular claim containing such introduced claimrecitation to embodiments containing only one such recitation, even whenthe same claim includes the introductory phrases “one or more” or “atleast one” and indefinite articles such as “a” or “an” (e.g., “a” and/or“an” should be interpreted to mean “at least one” or “one or more”); thesame holds true for the use of definite articles used to introduce claimrecitations.

Additionally, the use of the terms “first,” “second,” “third,” etc., arenot necessarily used herein to connote a specific order or number ofelements. Generally, the terms “first,” “second,” “third,” etc., areused to distinguish between different elements as generic identifiers.Absence a showing that the terms “first,” “second,” “third,” etc.,connote a specific order, these terms should not be understood toconnote a specific order. Furthermore, absence a showing that the termsfirst,” “second,” “third,” etc., connote a specific number of elements,these terms should not be understood to connote a specific number ofelements. For example, a first widget may be described as having a firstside and a second widget may be described as having a second side. Theuse of the term “second side” with respect to the second widget may beto distinguish such side of the second widget from the “first side” ofthe first widget and not to connote that the second widget has twosides.

All examples and conditional language recited herein are intended forpedagogical objects to aid the reader in understanding the invention andthe concepts contributed by the inventor to furthering the art, and areto be construed as being without limitation to such specifically recitedexamples and conditions. Although embodiments of the present disclosurehave been described in detail, it should be understood that the variouschanges, substitutions, and alterations could be made hereto withoutdeparting from the spirit and scope of the present disclosure.

What is claimed is:
 1. A computer-implemented method to providetranscriptions of an audio communication session, the method comprising:receiving audio data at an audio output device from a first device overa first wireless network, the audio data received by the first deviceover a second wireless network from a remote device that generated theaudio data and the second wireless network is a different type ofwireless network than the first wireless network; multiplexing the audiodata to generate first multiplexed audio data and second multiplexedaudio data; providing the first multiplexed audio data to a speaker ofthe audio output device for output of the first multiplexed audio data;transmitting, by the audio output device, the second multiplexed audiodata to a transcription system, the transcription system configured togenerate a transcript of the second multiplexed audio data; obtaining,at the audio output device, the transcript from the transcriptionsystem; and presenting, by the audio output device, the transcript. 2.The method of claim 1, wherein the first device is a smartphone, alaptop, or a tablet.
 3. The method of claim 1, wherein the secondmultiplexed audio data is transmitted to the transcription system over athird wireless network and the third wireless network is a differenttype of wireless network than the first wireless network and the secondwireless network.
 4. The method of claim 3, wherein the third wirelessnetwork is a IEEE 802.11 wireless network and the second wirelessnetwork is a cellular wireless network.
 5. The method of claim 1,wherein the first wireless network is a BLUETOOTH wireless network andthe audio output device is paired with the first device.
 6. The methodof claim 1, wherein the first multiplexed audio data and the secondmultiplexed audio data are the same as the audio data.
 7. The method ofclaim 1, wherein the transcript is presented in substantially real timewith the output of the first multiplexed audio data.
 8. Acomputer-implemented method to provide transcriptions of an audiocommunication session, the method comprising: receiving audio data at afirst device over a first network; providing, from the first device, theaudio data to a second device over a second network that is differentthan the first network; transmitting, to a transcription system from thesecond device, the audio data over a third network, the third network isa different type of network than the first network and the secondnetwork and the transcription system is configured to generate atranscript of the audio data; obtaining, at the second device from thetranscription system by way of the third network, the transcript of theaudio data; providing the transcript to the first device from the seconddevice by way of the second network; and presenting the transcript bythe first device.
 9. The method of claim 8, further comprising:multiplexing the audio data to generate first multiplexed audio data andsecond multiplexed audio data; and providing the first multiplexed audiodata to a speaker for output of the first multiplexed audio data. 10.The method of claim 9, wherein the audio data transmitted to thetranscription system is the second multiplexed audio data.
 11. Themethod of claim 8, wherein the second network is a IEEE 802.11 wirelessnetwork.
 12. The method of claim 8, wherein the second network is aBLUETOOTH wireless network.
 13. The method of claim 8, wherein the firstnetwork is a cellular wireless network.
 14. The method of claim 8,wherein the transcript is presented in substantially real time withoutput of the audio data.
 15. A computer-implemented method to providetranscriptions of an audio communication session, the method comprising:receiving audio data at a first device that is generated by a remotedevice; providing, from the first device, the audio data to a seconddevice over a first network; transmitting, to a transcription systemfrom the second device, the audio data over a second network, the secondnetwork is a different type of network than the first network and thetranscription system is configured to generate a transcript of the audiodata; obtaining, at the second device from the transcription system byway of the second network, the transcript of the audio data; providingthe transcript to the first device from the second device by way of thefirst network; and presenting the transcript by the first device. 16.The method of claim 15, further comprising: multiplexing the audio datato generate first multiplexed audio data and second multiplexed audiodata; and providing the first multiplexed audio data to a speaker foroutput of the first multiplexed audio data.
 17. The method of claim 15,wherein the audio data is received at the first device over a thirdnetwork.
 18. The method of claim 17, wherein the third network is adifferent type of network than the first network and the second network.19. The method of claim 17, wherein the second network and the thirdnetwork are both cellular wireless networks.
 20. The method of claim 15,wherein the first network is a IEEE 802.11 wireless network or aBLUETOOTH wireless network.